Reusing dynamic redistribution to eliminate cross-server operations and maintain semantics while scaling storage systems
نویسنده
چکیده
Distributed file systems that scale by partitioning files and directories among a collection of servers inevitably encounter cross-server operations. A common example is a rename that moves a file from a directory managed by one server to a directory managed by another. Systems that provide the same semantics for cross-server operations as for those that do not span servers traditionally implement dedicated protocols for these rare operations. This thesis explores an alternate, simpler, approach that exploits the existence of dynamic redistribution functionality (e.g., for load balancing, incorporation of new servers, and so on). When a client request would involve files on multiple servers, the system can redistribute those files onto one server and have it service the request. Although such redistribution is more expensive than a dedicated cross-server protocol, preliminary analysis of NFS traces indicates that such operations are extremely rare in file system workloads. Thus, when dynamic redistribution functionality exists in the system, cross-server operations can be handled with very little additional implementation complexity.
منابع مشابه
Eliminating cross-server operations in scalable file systems
Distributed file systems that scale by partitioning files and directories among a collection of servers inevitably encounter crossserver operations. A common example is a RENAME that moves a file from a directory managed by one server to a directory managed by another. Systems that provide the same semantics for cross-server operations as for those that do not span servers traditionally impleme...
متن کاملEliminating Cross-server Operations in Scalable File Systems (CMU-PDL-06-105)
Distributed file systems that scale by partitioning files and directories among a collection of servers inevitably encounter crossserver operations. A common example is a RENAME that moves a file from a directory managed by one server to a directory managed by another. Systems that provide the same semantics for cross-server operations as for those that do not span servers traditionally impleme...
متن کاملDynamic Proofs of Retrievability for Coded Cloud Storage Systems
Cloud storage allows users to store their data in a remote server to get rid of expensive local storage and management costs and then access data of interest anytime anywhere. A number of solutions have been proposed to tackle the verification of remote data integrity and retrievability in cloud storage systems. Most of existing schemes, however, do not support efficient data dynamics and/or su...
متن کاملpNFS and Linux: Working Towards a Heterogeneous Future
Heterogeneous and scalable remote data access is a critical enabling feature of widely distributed collaborations. Parallel file systems feature impressive throughput, but sacrifice heterogeneous access, seamless integration, security, and cross-site performance. Remote data access tools such as NFS and GridFTP provide secure access to parallel file systems, but either lack scalability (NFS) or...
متن کاملAn Implicitly Scalable, Fully Interactive Multimedia Storage Server
We are developing a next generation multimedia server that provides fully interactive access to tremendous amounts and varieties of real-time and non real-time multimedia data by hundreds of simultaneous clients. Current multimedia servers are inadequate for this task given their support of only basic multimedia data types, inherently non-interactive access semantics and/or intrinsic scaling li...
متن کامل